li2_nc reader daskified #2985

ClementLaplace · 2024-11-20T09:18:59Z

This pull request enables the daskification of the non accumulated and non transformed dataset.

Closes Reading LI L2 point data is not daskified #2814
Test that all the dataset are daskified for the the li_l2_nc reader

codecov · 2024-11-20T09:26:19Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.08%. Comparing base (fd2cec6) to head (fa56be5).
Report is 14 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2985      +/-   ##
==========================================
- Coverage   96.10%   96.08%   -0.02%     
==========================================
  Files         377      377              
  Lines       55147    55155       +8     
==========================================
  Hits        52997    52997              
- Misses       2150     2158       +8

Flag	Coverage Δ
behaviourtests	`3.94% <0.00%> (-0.01%)`	⬇️
unittests	`96.18% <100.00%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

coveralls · 2024-11-20T09:39:24Z

Pull Request Test Coverage Report for Build 11934839781

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

5 of 5 (100.0%) changed or added relevant lines in 2 files are covered.
8 unchanged lines in 3 files lost coverage.
Overall coverage decreased (-0.01%) to 96.193%

Files with Coverage Reduction	New Missed Lines	%
satpy/tests/utils.py	2	93.16%
satpy/tests/reader_tests/gms/test_gms5_vissr_l1b.py	3	98.67%
satpy/tests/reader_tests/gms/test_gms5_vissr_navigation.py	3	97.18%

Totals
Change from base Build 11815057382:	-0.01%
Covered Lines:	53241
Relevant Lines:	55348

💛 - Coveralls

mraspaud

LGTM, just a comment or two inline.

satpy/readers/li_l2_nc.py

satpy/tests/reader_tests/test_li_l2_nc.py

ameraner · 2024-11-20T15:26:36Z

So this PR is checking if an array is not daskified and converts it to a dask array if it's not.

I think it would be important to know why the array was not a dask array in the first place, otherwise we are still possibly loading the data into memory maybe unnecessarily. Tracking back the get_dataset in the base class, the variables are taken from the file here

satpy/satpy/readers/li_base_nc.py

Line 409 in fa56be5

return self[vpath]

could you please check if they are not daskified here already, and if so, why? And otherwise, if they're always daskified coming from the file, we should find out where in the code they are computed and turned into numpy...

ClementLaplace · 2024-11-21T12:04:36Z

@ameraner here is the result of my investigation.

The code that you showed me bellow

satpy/satpy/readers/li_base_nc.py

Line 409 in fa56be5

return self[vpath]

returns either non daskified or daskified xarray DataArray.

The reason of the "daskification" or not is to be found there

According to what I have seen all the non daskified array are in fact cached_data which is caused due to the the length of the array* the lenght of dtype_size is inferior to cache_var_size you can see this operation done there

Then some non daskified array are transformed into daskify in this part of the code it is depending of the transformation made.

ameraner · 2024-11-21T12:28:57Z

Hi @ClementLaplace , thanks for the analysis! I see, it's quite as @gerritholl was suspecting :) Ok, then I would say it is ok to keep this implementation, in order to make sure that we have homogenised dask array outputs for dataset.

ClementLaplace added 2 commits November 19, 2024 07:54

feat: Daskify the non accumumulated product for the li_l2_nc reader

1fe920e

fix: Correct the issue related to the test_li_l2_nc

6d83829

test : Verify that all the dataset encapsulate a dask array

9871d1a

ClementLaplace marked this pull request as ready for review November 20, 2024 11:57

ClementLaplace requested review from djhoese and mraspaud as code owners November 20, 2024 11:57

mraspaud assigned ClementLaplace Nov 20, 2024

mraspaud approved these changes Nov 20, 2024

View reviewed changes

satpy/readers/li_l2_nc.py Outdated Show resolved Hide resolved

satpy/tests/reader_tests/test_li_l2_nc.py Show resolved Hide resolved

typo : put space into the line 116 of readers/li_l2_nc.py

fa56be5

mraspaud added bug component:readers labels Nov 20, 2024

mraspaud merged commit 12c05dc into pytroll:main Nov 20, 2024
16 of 18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

li2_nc reader daskified #2985

li2_nc reader daskified #2985

ClementLaplace commented Nov 20, 2024 •

edited

Loading

codecov bot commented Nov 20, 2024 •

edited

Loading

coveralls commented Nov 20, 2024 •

edited

Loading

mraspaud left a comment

ameraner commented Nov 20, 2024

ClementLaplace commented Nov 21, 2024 •

edited

Loading

ameraner commented Nov 21, 2024

li2_nc reader daskified #2985

li2_nc reader daskified #2985

Conversation

ClementLaplace commented Nov 20, 2024 • edited Loading

codecov bot commented Nov 20, 2024 • edited Loading

Codecov Report

coveralls commented Nov 20, 2024 • edited Loading

Pull Request Test Coverage Report for Build 11934839781

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

mraspaud left a comment

Choose a reason for hiding this comment

ameraner commented Nov 20, 2024

ClementLaplace commented Nov 21, 2024 • edited Loading

ameraner commented Nov 21, 2024

ClementLaplace commented Nov 20, 2024 •

edited

Loading

codecov bot commented Nov 20, 2024 •

edited

Loading

coveralls commented Nov 20, 2024 •

edited

Loading

ClementLaplace commented Nov 21, 2024 •

edited

Loading